Ending | Count |
---|---|
). | 96 |
". | 74 |
יותר. | 54 |
זה. | 47 |
שלו. | 43 |
הברית. | 38 |
זו. | 33 |
שנים. | 32 |
אותו. | 31 |
ישראל. | 30 |
רבים. | 28 |
בלבד. | 26 |
שונים. | 26 |
בו. | 25 |
ביותר. | 25 |
נוספים. | 25 |
אחרים. | 24 |
העיר. | 23 |
אחד. | 22 |
הארץ. | 21 |
שנה. | 21 |
שלה. | 21 |
העולם. | 20 |
בישראל. | 19 |
אותם. | 19 |
המדינה. | 19 |
המלחמה. | 18 |
בעולם. | 17 |
שונות. | 17 |
היום. | 17 |
In the next four subsections show the most frequent sentence endings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the end of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', -1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.1 Most Frequent Sentence Beginnings I
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV